Confirm Delete?
Are you sure you want to remove from the report?

Data Preview

Tail

longitude latitude housing_median_age total_rooms total_bedrooms population households median_income rooms_per_household population_per_household bedrooms_per_room 1H OCEAN INLAND ISLAND NEAR BAY NEAR OCEAN median_house_value
16507 -0.091860 -0.570036 -0.524113 -0.007113 -0.232687 -0.350198 -0.200708 0.732561 0.255372 -0.063783 -0.610777 0.000000 0.000000 0.000000 0.000000 1.000000 1.166508
16508 -0.680633 1.421510 -1.479923 0.487642 0.109550 0.322396 0.175824 0.775757 0.365321 0.004851 -0.747404 0.000000 1.000000 0.000000 0.000000 0.000000 0.084283
16509 -0.196641 0.584686 -0.205509 -0.729033 -0.814490 -0.615781 -0.729720 -0.711357 -0.155025 0.020018 -0.420664 0.000000 1.000000 0.000000 0.000000 0.000000 -1.251668
16510 0.891092 -0.873911 -0.842716 0.559948 -0.144276 0.124828 -0.122912 1.753089 1.057276 0.034776 -1.216717 1.000000 0.000000 0.000000 0.000000 0.000000 0.716012
16511 1.170510 -0.710286 -0.364811 -0.775149 -0.612000 -0.859771 -0.938213 -1.523681 0.431790 0.013931 0.663432 0.000000 1.000000 0.000000 0.000000 0.000000 -1.034188

Pre Processing

Pre Processing - Imputations

Pre Processing - Imputations - Missing

No missing values in data

Pre Processing - Imputations - Infinitys

No Infinity values

Health Analysis

Health Plot

Missing Plot

Missing Value Summary

No Missing Values

Duplicate Columns

No duplicate variables

Outliers In Features

Data Shape:(16512, 17)
feature < (mean-3*std) > (mean+3*std) < (1stQ - 1.5 * IQR) > (3rdQ + 1.5 * IQR) -inf +inf
total_rooms 0 460 0 1038 0 0
total_bedrooms 0 452 0 1073 0 0
population 0 425 0 963 0 0
households 0 463 0 988 0 0
median_income 0 322 0 555 0 0
rooms_per_household 0 110 39 384 0 0
population_per_household 0 9 7 566 0 0
bedrooms_per_room 0 175 10 514 0 0
ISLAND 0 4 0 4 0 0
NEAR BAY 0 0 0 1857 0 0
NEAR OCEAN 0 0 0 2114 0 0
median_house_value 0 0 0 874 0 0

Feature Analysis

Summary Stats

Summary Stats - Numeric Variables

Variable Name Datatype No of Unique Samples Mean Standard Deviation Min 25th percentile Median 75th percentile Max
0 bedrooms_per_room float64 15318 [-0.41497843993750105, 0.22433683411069402, 0.7881036630378417, -0.3417164754449367, -0.695387293175209] -0.022533 0.817598 -2.529170 -0.577699 -0.171339 0.392727 3.000091
1 households float64 1352 [0.9039936245233167, -0.8044038678104428, -0.7203843190071432, 0.6457113078316918, -0.5990227485134881] -0.013890 0.953064 -1.517014 -0.651924 -0.247385 0.365646 3.000091
2 housing_median_age float64 52 [0.4316979713287185, 0.5909997469442371, -0.36481090674887434, 0.033443532289922084, -1.878177775096301] -0.000000 1.000030 -2.196781 -0.842716 0.033444 0.670651 1.865414
3 latitude float64 842 [-0.5279613615285398, -0.8037858047209856, 1.6786341840110133, 1.3280098918172283, -1.2806348421045346] -0.000000 1.000030 -1.453609 -0.799111 -0.644836 0.968036 2.945557
4 longitude float64 827 [1.1405723234300162, 0.7713417323858928, -0.2165995787862045, -0.9550607608744446, 1.2104267595734985] 0.000000 1.000030 -2.352149 -1.109738 0.531841 0.786311 2.632463
5 median_house_value float64 3686 [-1.1481062913939337, -0.39296558926643954, -0.9323518050717926, -1.1144485915276798, 0.06270788584592268] 0.000000 1.000030 -1.659885 -0.757159 -0.232013 0.503710 2.525770
6 median_income float64 10633 [-0.3820225088420924, -0.3624963041011309, -1.1781694498209412, -0.6242269147423309, 1.2786130897089703] -0.004798 0.984889 -1.893941 -0.722653 -0.171236 0.514466 3.000091
7 population float64 3245 [1.0198199229933904, -0.24331707709391862, -0.6902732463555817, 1.00038704606897, -0.35343671299896606] -0.015879 0.945213 -1.497817 -0.653567 -0.243317 0.359102 3.000091
8 population_per_household float64 15078 [-0.00707667237316579, 0.23554927313295906, -0.015906643580314822, 0.03183087287395963, 0.06259562181095774] -0.013743 0.123808 -0.277551 -0.073336 -0.027475 0.024575 3.000091
9 rooms_per_household float64 15356 [0.148395551124925, -0.37289896407022516, 0.34675430424695863, -0.10140573771899154, 0.021473018803055254] -0.033411 0.590853 -1.947346 -0.413274 -0.075139 0.255975 3.000091
10 total_bedrooms float64 1465 [0.8938437911929445, -0.814490285014096, -0.2811706318409465, 0.47175123146232345, -0.7203750521011874] -0.014762 0.949733 -1.493261 -0.649076 -0.258355 0.340560 3.000091
11 total_rooms float64 5046 [1.0979717079285942, -0.8651047301484527, -0.5479836234173288, 0.5929691555578459, -0.5383048822783179] -0.018061 0.936982 -1.456647 -0.637939 -0.250220 0.336198 3.000091

Summary Stats - Non Numeric Variables

Variable Name Datatype No of Unique Samples Mode Mode Freq Mode Freq %
0 NEAR BAY float64 2 [0.0, 1.0] 0.000000 14655.000000 88.753634
1 ISLAND float64 2 [0.0, 1.0] 0.000000 16508.000000 99.975775
2 NEAR OCEAN float64 2 [0.0, 1.0] 0.000000 14398.000000 87.197190
3 INLAND float64 2 [1.0, 0.0] 0.000000 11283.000000 68.332122
4 1H OCEAN float64 2 [0.0, 1.0] 0.000000 9204.000000 55.741279

Distributions

Distributions - Numeric Variables

Distributions - Numeric Variables - 1h Ocean

Distributions - Numeric Variables - Inland

Distributions - Numeric Variables - Island

Distributions - Numeric Variables - Near Bay

Distributions - Numeric Variables - Near Ocean

Distributions - Numeric Variables - Bedrooms Per Room

Distributions - Numeric Variables - Households

Distributions - Numeric Variables - Housing Median Age

Distributions - Numeric Variables - Latitude

Distributions - Numeric Variables - Longitude

Distributions - Numeric Variables - Median House Value

Distributions - Numeric Variables - Median Income

Distributions - Numeric Variables - Population

Distributions - Numeric Variables - Population Per Household

Distributions - Numeric Variables - Rooms Per Household

Distributions - Numeric Variables - Total Bedrooms

Distributions - Numeric Variables - Total Rooms

Distributions - Non Numeric Variables

No categorical variables in data.

Feature Normality

Feature Interactions

Correlation Table

Variable 1 Variable 2 Corr Coef Abs Corr Coef
0 households total_bedrooms 0.972292 0.972292
1 latitude longitude -0.923566 0.923566
2 total_bedrooms total_rooms 0.917576 0.917576
3 households total_rooms 0.913138 0.913138
4 households population 0.907582 0.907582
5 population total_bedrooms 0.870947 0.870947
6 population total_rooms 0.841216 0.841216
7 bedrooms_per_room rooms_per_household -0.735781 0.735781
8 median_house_value median_income 0.692286 0.692286
9 bedrooms_per_room median_income -0.658586 0.658586
10 1H OCEAN INLAND -0.606608 0.606608
11 median_income rooms_per_household 0.587385 0.587385
12 INLAND median_house_value -0.487042 0.487042
13 NEAR BAY longitude -0.474872 0.474872
14 1H OCEAN latitude -0.447990 0.447990
15 housing_median_age total_rooms -0.385412 0.385412
16 NEAR BAY latitude 0.358199 0.358199
17 INLAND latitude 0.350984 0.350984
18 1H OCEAN NEAR OCEAN -0.341438 0.341438
19 housing_median_age total_bedrooms -0.332247 0.332247
20 1H OCEAN longitude 0.321373 0.321373
21 1H OCEAN NEAR BAY -0.317193 0.317193
22 households housing_median_age -0.313191 0.313191
23 housing_median_age population -0.312313 0.312313
24 bedrooms_per_room median_house_value -0.274113 0.274113
25 median_house_value rooms_per_household 0.272339 0.272339
26 INLAND NEAR OCEAN -0.260855 0.260855
27 1H OCEAN median_house_value 0.258949 0.258949
28 INLAND median_income -0.249049 0.249049
29 NEAR BAY housing_median_age 0.246640 0.246640
30 INLAND NEAR BAY -0.242332 0.242332
31 median_income total_rooms 0.241309 0.241309
32 INLAND housing_median_age -0.231686 0.231686
33 bedrooms_per_room total_rooms -0.228248 0.228248
34 rooms_per_household total_rooms 0.227762 0.227762
35 housing_median_age rooms_per_household -0.224339 0.224339
36 median_house_value population_per_household -0.195367 0.195367
37 1H OCEAN median_income 0.181885 0.181885
38 INLAND rooms_per_household 0.180733 0.180733
39 population population_per_household 0.174524 0.174524
40 median_house_value total_rooms 0.161998 0.161998
41 NEAR OCEAN latitude -0.160620 0.160620
42 NEAR BAY median_house_value 0.159617 0.159617
43 population_per_household total_bedrooms -0.159056 0.159056
44 latitude median_house_value -0.145156 0.145156
45 bedrooms_per_room housing_median_age 0.144941 0.144941
46 households population_per_household -0.143922 0.143922
47 housing_median_age median_income -0.143620 0.143620
48 NEAR OCEAN median_house_value 0.141237 0.141237
49 population_per_household total_rooms -0.136965 0.136965
50 NEAR BAY NEAR OCEAN -0.136400 0.136400
51 latitude rooms_per_household 0.135265 0.135265
52 latitude population -0.123225 0.123225
53 households rooms_per_household -0.121681 0.121681
54 1H OCEAN rooms_per_household -0.121175 0.121175
55 longitude population_per_household 0.119223 0.119223
56 bedrooms_per_room latitude -0.117976 0.117976
57 bedrooms_per_room total_bedrooms 0.114338 0.114338
58 INLAND bedrooms_per_room -0.112131 0.112131
59 population rooms_per_household -0.111648 0.111648
60 latitude population_per_household -0.111575 0.111575
61 housing_median_age longitude -0.106657 0.106657
62 longitude population 0.106436 0.106436
63 NEAR BAY population_per_household -0.106074 0.106074
64 housing_median_age median_house_value 0.100225 0.100225
65 bedrooms_per_room longitude 0.098285 0.098285
66 1H OCEAN population 0.092779 0.092779
67 1H OCEAN population_per_household 0.087572 0.087572
68 latitude median_income -0.081398 0.081398
69 households median_house_value 0.076849 0.076849
70 households latitude -0.074037 0.074037
71 1H OCEAN bedrooms_per_room 0.070837 0.070837
72 bedrooms_per_room households 0.070113 0.070113
73 NEAR OCEAN population_per_household -0.066079 0.066079
74 latitude total_bedrooms -0.064269 0.064269
75 NEAR BAY population -0.060512 0.060512
76 longitude total_bedrooms 0.060031 0.060031
77 longitude rooms_per_household -0.059944 0.059944
78 median_house_value total_bedrooms 0.058884 0.058884
79 INLAND households -0.058200 0.058200
80 NEAR BAY median_income 0.058195 0.058195
81 INLAND longitude -0.053232 0.053232
82 households longitude 0.049718 0.049718
83 NEAR OCEAN bedrooms_per_room 0.047807 0.047807
84 longitude median_house_value -0.047527 0.047527
85 1H OCEAN households 0.046603 0.046603
86 1H OCEAN housing_median_age 0.045427 0.045427
87 NEAR OCEAN longitude 0.044954 0.044954
88 median_income population_per_household -0.044916 0.044916
89 bedrooms_per_room population 0.043498 0.043498
90 NEAR OCEAN rooms_per_household -0.042192 0.042192
91 rooms_per_household total_bedrooms -0.040378 0.040378
92 population_per_household rooms_per_household -0.040005 0.040005
93 INLAND population -0.039535 0.039535
94 NEAR BAY rooms_per_household -0.031082 0.031082
95 longitude total_rooms 0.029700 0.029700
96 latitude total_rooms -0.028055 0.028055
97 median_house_value population -0.027907 0.027907
98 INLAND population_per_household 0.026244 0.026244
99 NEAR OCEAN population -0.025143 0.025143
100 INLAND total_bedrooms -0.021976 0.021976
101 NEAR OCEAN median_income 0.021780 0.021780
102 NEAR OCEAN housing_median_age 0.021160 0.021160
103 ISLAND median_house_value 0.020919 0.020919
104 longitude median_income -0.019274 0.019274
105 households median_income 0.018859 0.018859
106 1H OCEAN total_bedrooms 0.018219 0.018219
107 ISLAND bedrooms_per_room 0.018139 0.018139
108 ISLAND latitude -0.016642 0.016642
109 ISLAND housing_median_age 0.014159 0.014159
110 1H OCEAN ISLAND -0.013871 0.013871
111 NEAR BAY total_rooms -0.013491 0.013491
112 INLAND total_rooms 0.012853 0.012853
113 NEAR OCEAN total_bedrooms 0.011489 0.011489
114 housing_median_age latitude 0.011252 0.011252
115 ISLAND population -0.011129 0.011129
116 NEAR OCEAN households 0.010909 0.010909
117 INLAND ISLAND -0.010597 0.010597
118 ISLAND median_income -0.009703 0.009703
119 ISLAND longitude 0.009502 0.009502
120 ISLAND households -0.009062 0.009062
121 median_income total_bedrooms -0.008842 0.008842
122 NEAR BAY total_bedrooms -0.008299 0.008299
123 ISLAND total_rooms -0.007695 0.007695
124 ISLAND population_per_household -0.007296 0.007296
125 median_income population 0.007153 0.007153
126 housing_median_age population_per_household 0.006546 0.006546
127 bedrooms_per_room population_per_household 0.006252 0.006252
128 ISLAND NEAR OCEAN -0.005965 0.005965
129 ISLAND NEAR BAY -0.005541 0.005541
130 NEAR OCEAN total_rooms -0.004108 0.004108
131 ISLAND total_bedrooms -0.002801 0.002801
132 NEAR BAY bedrooms_per_room 0.002283 0.002283
133 ISLAND rooms_per_household 0.001949 0.001949
134 NEAR BAY households 0.001336 0.001336
135 1H OCEAN total_rooms -0.000451 0.000451

Correlation Heatmap

Covariance Heatmap

Bivariate Plots (top 50 Correlations)

Bivariate Plots (top 50 Correlations) - Total Bedrooms Vs Households

Bivariate Plots (top 50 Correlations) - Total Rooms Vs Total Bedrooms

Bivariate Plots (top 50 Correlations) - Total Rooms Vs Households

Bivariate Plots (top 50 Correlations) - Population Vs Households

Bivariate Plots (top 50 Correlations) - Total Bedrooms Vs Population

Bivariate Plots (top 50 Correlations) - Total Rooms Vs Population

Bivariate Plots (top 50 Correlations) - Median Income Vs Median House Value

Bivariate Plots (top 50 Correlations) - Rooms Per Household Vs Median Income

Bivariate Plots (top 50 Correlations) - Latitude Vs Near Bay

Bivariate Plots (top 50 Correlations) - Latitude Vs Inland

Bivariate Plots (top 50 Correlations) - Longitude Vs 1h Ocean

Bivariate Plots (top 50 Correlations) - Rooms Per Household Vs Median House Value

Bivariate Plots (top 50 Correlations) - Median House Value Vs 1h Ocean

Bivariate Plots (top 50 Correlations) - Housing Median Age Vs Near Bay

Bivariate Plots (top 50 Correlations) - Total Rooms Vs Median Income

Bivariate Plots (top 50 Correlations) - Total Rooms Vs Rooms Per Household

Bivariate Plots (top 50 Correlations) - Median Income Vs 1h Ocean

Bivariate Plots (top 50 Correlations) - Rooms Per Household Vs Inland

Bivariate Plots (top 50 Correlations) - Population Per Household Vs Population

Bivariate Plots (top 50 Correlations) - Total Rooms Vs Median House Value

Bivariate Plots (top 50 Correlations) - Median House Value Vs Near Bay

Bivariate Plots (top 50 Correlations) - Housing Median Age Vs Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Median House Value Vs Near Ocean

Bivariate Plots (top 50 Correlations) - Rooms Per Household Vs Latitude

Bivariate Plots (top 50 Correlations) - Population Per Household Vs Longitude

Bivariate Plots (top 50 Correlations) - Total Bedrooms Vs Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Population Vs Longitude

Bivariate Plots (top 50 Correlations) - Median House Value Vs Housing Median Age

Bivariate Plots (top 50 Correlations) - Longitude Vs Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Population Vs 1h Ocean

Bivariate Plots (top 50 Correlations) - Population Per Household Vs 1h Ocean

Bivariate Plots (top 50 Correlations) - Median House Value Vs Households

Bivariate Plots (top 50 Correlations) - Bedrooms Per Room Vs 1h Ocean

Bivariate Plots (top 50 Correlations) - Households Vs Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Total Bedrooms Vs Longitude

Bivariate Plots (top 50 Correlations) - Total Bedrooms Vs Median House Value

Bivariate Plots (top 50 Correlations) - Median Income Vs Near Bay

Bivariate Plots (top 50 Correlations) - Longitude Vs Households

Bivariate Plots (top 50 Correlations) - Bedrooms Per Room Vs Near Ocean

Bivariate Plots (top 50 Correlations) - Households Vs 1h Ocean

Bivariate Plots (top 50 Correlations) - Housing Median Age Vs 1h Ocean

Bivariate Plots (top 50 Correlations) - Longitude Vs Near Ocean

Bivariate Plots (top 50 Correlations) - Population Vs Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Total Rooms Vs Longitude

Bivariate Plots (top 50 Correlations) - Population Per Household Vs Inland

Bivariate Plots (top 50 Correlations) - Median Income Vs Near Ocean

Bivariate Plots (top 50 Correlations) - Housing Median Age Vs Near Ocean

Bivariate Plots (top 50 Correlations) - Median House Value Vs Island

Bivariate Plots (top 50 Correlations) - Median Income Vs Households

Bivariate Plots (top 50 Correlations) - Total Bedrooms Vs 1h Ocean

Key Drivers

Median House Value

Median House Value - Feature Scores - Feature Correlation

Median House Value - Feature Importances - From Model

Median House Value - Pca Analysis

Median House Value - Pca Analysis - Pca Projection

Median House Value - Pca Analysis - Correlation With Dimension 2 (y)

Median House Value - Pca Analysis - Correlation With Dimension 1 (x)

Median House Value - Bivariate Plots

Median House Value - Bivariate Plots - Population Per Household

Median House Value - Bivariate Plots - Median Income

Median House Value - Bivariate Plots - Population

Median House Value - Bivariate Plots - Housing Median Age

Median House Value - Bivariate Plots - Latitude

Median House Value - Bivariate Plots - Near Bay

Median House Value - Bivariate Plots - Total Rooms

Median House Value - Bivariate Plots - Total Bedrooms

Median House Value - Bivariate Plots - Households

Median House Value - Bivariate Plots - Bedrooms Per Room

Median House Value - Bivariate Plots - Island

Median House Value - Bivariate Plots - Near Ocean

Median House Value - Bivariate Plots - Inland

Median House Value - Bivariate Plots - Rooms Per Household

Median House Value - Bivariate Plots - Longitude

Median House Value - Bivariate Plots - 1h Ocean